Confidence intervals for policy evaluation in adaptive experiments

نویسندگان

چکیده

Significance Randomized controlled trials are central to the scientific process, but they can be costly. For example, a clinical trial may assign patients treatments that detrimental them. Adaptive experimental designs, such as multiarmed bandit algorithms, reduce costs by increasing probability of assigning promising over course experiment. However, because observations collected these methods dependent and their distribution is nonstationary, statistical inference challenging. We propose treatment-effect estimator has an asymptotically unbiased normal test statistic under straightforward, relatively weak conditions on adaptive design. This generalizes for variety parameters interest.

منابع مشابه

Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation

In many reinforcement learning applications executing a poor policy may be costly or even dangerous. Thus, it is desirable to determine confidence interval lower bounds on the performance of any given policy without executing said policy. Current methods for high confidence off-policy evaluation require a substantial amount of data to achieve a tight lower bound, while existing model-based meth...

متن کامل

Model-averaged confidence intervals for factorial experiments

We consider the coverage rate of model-averaged confidence intervals for the treatment means in a factorial experiment, whenwe use a normal linearmodel in the analysis.Modelaveraging provides a useful compromise between using the full model (containing all main effects and interactions) and a ‘‘best model’’ obtained by some model-selection process. Use of the full model guarantees perfect cover...

متن کامل

Confidence Intervals for Information Retrieval Evaluation

Information retrieval results are currently limited to the publication in which they exist. Significance tests are used to remove the dependence of the evaluation on the query sample, but the findings cannot be transferred to other systems not involved in the test. Confidence intervals for the population parameters provide query independent results and give insight to how each system is expecte...

متن کامل

Adaptive Markov Chain Monte Carlo Confidence Intervals

In Adaptive Markov Chain Monte Carlo (AMCMC) simulation, classical estimators of asymptotic variances are inconsistent in general. In this work we establish that despite this inconsistency, confidence interval procedures based on these estimators remain consistent. We study two classes of confidence intervals, one based on the standard Gaussian limit theory, and the class of so-called fixed-b c...

متن کامل

Adaptive Confidence Intervals for the Test Error in Classification

Sample Size n = 30 n = 100 n = 250 Data Set / Method ACI Yang Jiang ACI Yang Jiang ACI Yang Jiang ThreePt .976 .893* .914* .961 .552* .945 .961 .387* .930* Magic .955 .999* .983* .977* .991* .969* .972* .997* .974* Mam. .957 .989* .966 .962 .996* .964 .960 .995* .968 Ion. .947 .995* .985* .948 .996* .970* .970 .990* .970* Donut .968 .966 .908* .969 .851* .971* .973* .898* .966 Bal. .979* .996* ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the National Academy of Sciences of the United States of America

سال: 2021

ISSN: ['1091-6490', '0027-8424']

DOI: https://doi.org/10.1073/pnas.2014602118